Pixel Difference Convolutional Network for RGB-D Semantic Segmentation
نویسندگان
چکیده
RGB-D semantic segmentation can be advanced with convolutional neural networks due to the availability of Depth data. Although objects cannot easily discriminated by just 2D appearance, local pixel difference and geometric patterns in Depth, they well separated some cases. Considering fixed grid kernel structure, CNNs are limited lack ability capture detailed, fine-grained information thus achieve accurate pixel-level segmentation. To solve this problem CNN we propose a Pixel Difference Convolutional Network (PDCNet) detailed intrinsic aggregating both intensity gradient range for data global RGB data, respectively. Precisely, PDCNet consists branch an branch. For branch, Convolution (PDC) consider via information. contribute lightweight Cascade Large Kernel (CLK) extend PDC, namely CPDC, enjoy contexts further boost performance. Consequently, differences from modal seamlessly incorporated into during propagation process. Experiments on three challenging benchmark datasets, i.e ., NYUDv2 (78.4 Acc., 53.5 mIoU), SUN (83.3 49.6 mIoU) SID Dataset (83.1 61.4 reveal that our achieves state-of-the-art performance task.
منابع مشابه
Dense RGB-D semantic mapping with Pixel-Voxel neural network
For intelligent robotics applications, extending 3D mapping to 3D semantic mapping enables robots to, not only localize themselves with respect to the scene’s geometrical features but also simultaneously understand the higher level meaning of the scene contexts. Most previous methods focus on geometric 3D reconstruction and scene understanding independently notwithstanding the fact that joint e...
متن کاملTraining Bit Fully Convolutional Network for Fast Semantic Segmentation
Fully convolutional neural networks give accurate, per-pixel prediction for input images and have applications like semantic segmentation. However, a typical FCN usually requires lots of floating point computation and large run-time memory, which effectively limits its usability. We propose a method to train Bit Fully Convolution Network (BFCN), a fully convolutional neural network that has low...
متن کاملDepth-aware CNN for RGB-D Segmentation
Convolutional neural networks (CNN) are limited by the lack of capability to handle geometric information due to the fixed grid kernel structure. The availability of depth data enables progress in RGB-D semantic segmentation with CNNs. State-of-the-art methods either use depth as additional images or process spatial information in 3D volumes or point clouds. These methods suffer from high compu...
متن کاملPer-Pixel Feedback for improving Semantic Segmentation
Semantic segmentation is the task of assigning a label to each pixel in the image.In recent years, deep convolutional neural networks have been driving advances in multiple tasks related to cognition. Although, DCNNs have resulted in unprecedented visual recognition performances, they offer little transparency. To understand how DCNN based models work at the task of semantic segmentation, we tr...
متن کاملA hierarchical Convolutional Neural Network for Segmentation of Stroke Lesion in 3D Brain MRI
Introduction: Brain tumors such as glioma are among the most aggressive lesions, which result in a very short life expectancy in patients. Image segmentation is highly essential in medical image analysis with applications, particularly in clinical practices to treat brain tumors. Accurate segmentation of magnetic resonance data is crucial for diagnostic purposes, planning surgical treatments, a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology
سال: 2023
ISSN: ['1051-8215', '1558-2205']
DOI: https://doi.org/10.1109/tcsvt.2023.3296162